Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
JACC Cardiovasc Imaging ; 15(2): 193-208, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34538625

RESUMO

OBJECTIVES: This study sought to identify homogenous echocardiographic phenotypes in community-based cohorts and assess their association with outcomes. BACKGROUND: Asymptomatic cardiac dysfunction leads to a high risk of long-term cardiovascular morbidity and mortality; however, better echocardiographic classification of asymptomatic individuals remains a challenge. METHODS: Echocardiographic phenotypes were identified using K-means clustering in the first generation of the STANISLAS (Yearly non-invasive follow-up of Health status of Lorraine insured inhabitants) cohort (N = 827; mean age: 60 ± 5 years; men: 48%), and their associations with vascular function and circulating biomarkers were also assessed. These phenotypes were externally validated in the Malmö Preventive Project cohort (N = 1,394; mean age: 67 ± 6 years; men: 70%), and their associations with the composite of cardiovascular mortality (CVM) or heart failure hospitalization (HFH) were assessed as well. RESULTS: Three echocardiographic phenotypes were identified as "mostly normal (MN)" (n = 334), "diastolic changes (D)" (n = 323), and "diastolic changes with structural remodeling (D/S)" (n = 170). The D and D/S phenotypes had similar ages, body mass indices, cardiovascular risk factors, vascular impairments, and diastolic function changes. The D phenotype consisted mainly of women and featured increased levels of inflammatory biomarkers, whereas the D/S phenotype, consisted predominantly of men, displayed the highest values of left ventricular mass, volume, and remodeling biomarkers. The phenotypes were predicted based on a simple algorithm including e', left ventricular mass and volume (e'VM algorithm). In the Malmö cohort, subgroups derived from e'VM algorithm were significantly associated with a higher risk of CVM and HFH (adjusted HR in the D phenotype = 1.87; 95% CI: 1.04 to 3.37; adjusted HR in the D/S phenotype = 3.02; 95% CI: 1.71 to 5.34). CONCLUSIONS: Among asymptomatic, middle-aged individuals, echocardiographic data-driven classification based on the simple e'VM algorithm identified profiles with different long-term HF risk. (4th Visit at 17 Years of Cohort STANISLAS-Stanislas Ancillary Study ESCIF [STANISLASV4]; NCT01391442).


Assuntos
Ecocardiografia , Insuficiência Cardíaca , Idoso , Feminino , Insuficiência Cardíaca/diagnóstico por imagem , Insuficiência Cardíaca/epidemiologia , Humanos , Incidência , Aprendizado de Máquina , Masculino , Pessoa de Meia-Idade , Fenótipo , Valor Preditivo dos Testes , Prognóstico , Volume Sistólico , Função Ventricular Esquerda
2.
Sci Rep ; 11(1): 4202, 2021 02 18.
Artigo em Inglês | MEDLINE | ID: mdl-33603019

RESUMO

The choice of the most appropriate unsupervised machine-learning method for "heterogeneous" or "mixed" data, i.e. with both continuous and categorical variables, can be challenging. Our aim was to examine the performance of various clustering strategies for mixed data using both simulated and real-life data. We conducted a benchmark analysis of "ready-to-use" tools in R comparing 4 model-based (Kamila algorithm, Latent Class Analysis, Latent Class Model [LCM] and Clustering by Mixture Modeling) and 5 distance/dissimilarity-based (Gower distance or Unsupervised Extra Trees dissimilarity followed by hierarchical clustering or Partitioning Around Medoids, K-prototypes) clustering methods. Clustering performances were assessed by Adjusted Rand Index (ARI) on 1000 generated virtual populations consisting of mixed variables using 7 scenarios with varying population sizes, number of clusters, number of continuous and categorical variables, proportions of relevant (non-noisy) variables and degree of variable relevance (low, mild, high). Clustering methods were then applied on the EPHESUS randomized clinical trial data (a heart failure trial evaluating the effect of eplerenone) allowing to illustrate the differences between different clustering techniques. The simulations revealed the dominance of K-prototypes, Kamila and LCM models over all other methods. Overall, methods using dissimilarity matrices in classical algorithms such as Partitioning Around Medoids and Hierarchical Clustering had a lower ARI compared to model-based methods in all scenarios. When applying clustering methods to a real-life clinical dataset, LCM showed promising results with regard to differences in (1) clinical profiles across clusters, (2) prognostic performance (highest C-index) and (3) identification of patient subgroups with substantial treatment benefit. The present findings suggest key differences in clustering performance between the tested algorithms (limited to tools readily available in R). In most of the tested scenarios, model-based methods (in particular the Kamila and LCM packages) and K-prototypes typically performed best in the setting of heterogeneous data.

3.
Sci Data ; 7(1): 3, 2020 01 02.
Artigo em Inglês | MEDLINE | ID: mdl-31896797

RESUMO

Pharmacogenomics (PGx) studies how individual gene variations impact drug response phenotypes, which makes PGx-related knowledge a key component towards precision medicine. A significant part of the state-of-the-art knowledge in PGx is accumulated in scientific publications, where it is hardly reusable by humans or software. Natural language processing techniques have been developed to guide experts who curate this amount of knowledge. But existing works are limited by the absence of a high quality annotated corpus focusing on PGx domain. In particular, this absence restricts the use of supervised machine learning. This article introduces PGxCorpus, a manually annotated corpus, designed to fill this gap and to enable the automatic extraction of PGx relationships from text. It comprises 945 sentences from 911 PubMed abstracts, annotated with PGx entities of interest (mainly gene variations, genes, drugs and phenotypes), and relationships between those. In this article, we present the corpus itself, its construction and a baseline experiment that illustrates how it may be leveraged to synthesize and summarize PGx knowledge.


Assuntos
Curadoria de Dados , Farmacogenética , Aprendizado de Máquina Supervisionado , Humanos , PubMed
4.
J Biomed Semantics ; 8(1): 16, 2017 Apr 20.
Artigo em Inglês | MEDLINE | ID: mdl-28427468

RESUMO

BACKGROUND: A standard task in pharmacogenomics research is identifying genes that may be involved in drug response variability, i.e., pharmacogenes. Because genomic experiments tended to generate many false positives, computational approaches based on the use of background knowledge have been proposed. Until now, only molecular networks or the biomedical literature were used, whereas many other resources are available. METHOD: We propose here to consume a diverse and larger set of resources using linked data related either to genes, drugs or diseases. One of the advantages of linked data is that they are built on a standard framework that facilitates the joint use of various sources, and thus facilitates considering features of various origins. We propose a selection and linkage of data sources relevant to pharmacogenomics, including for example DisGeNET and Clinvar. We use machine learning to identify and prioritize pharmacogenes that are the most probably valid, considering the selected linked data. This identification relies on the classification of gene-drug pairs as either pharmacogenomically associated or not and was experimented with two machine learning methods -random forest and graph kernel-, which results are compared in this article. RESULTS: We assembled a set of linked data relative to pharmacogenomics, of 2,610,793 triples, coming from six distinct resources. Learning from these data, random forest enables identifying valid pharmacogenes with a F-measure of 0.73, on a 10 folds cross-validation, whereas graph kernel achieves a F-measure of 0.81. A list of top candidates proposed by both approaches is provided and their obtention is discussed.


Assuntos
Biologia Computacional/métodos , Aprendizado de Máquina , Farmacogenética , Web Semântica , Gráficos por Computador , Mineração de Dados , Fenótipo
5.
Stud Health Technol Inform ; 210: 702-6, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25991243

RESUMO

UNLABELLED: Reusing medication data is crucial for many medical research domains. Semantic integration of such data in clinical data warehouse (CDW) is quite challenging. Our objective was to develop a reliable and scalable method for integrating prescription data into EHOP (a French CDW). METHOD: PN13/PHAST was used as the semantic interoperability standard during the ETL process, and to store the prescriptions as documents in the CDW. Theriaque was used as a drug knowledge database (DKDB), to annotate the prescription dataset with the finest granularity, and to provide semantic capabilities to the EHOP query workbench. RESULTS: the system was evaluated on a clinical data set. Depending on the use case, the precision ranged from 52% to 100%, Recall was always 100%. CONCLUSION: interoperability standards and DKDB, document approach, and the finest granularity approach are the key factors for successful drug data integration in CDW.


Assuntos
Bases de Dados de Produtos Farmacêuticos , Registros Eletrônicos de Saúde/organização & administração , Sistemas de Registro de Ordens Médicas/organização & administração , Registro Médico Coordenado/métodos , Sistemas de Medicação no Hospital/organização & administração , Semântica , Armazenamento e Recuperação da Informação/métodos , Processamento de Linguagem Natural , Integração de Sistemas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...